Typing fixes and cleanup in `estimators.py` #1249

eddiebergman · 2021-09-15T11:40:01Z

Note: Built from PR Fix sparse y test #1248, should be considered after that.
Note2: Rebasing added some other PR's into this -_-

This PR fixes some typing issues that were present, notably the self.automl_ attribute is now created during __init__() rather than on the first call to fit, removing the _build_automl() function.

This was primarily to get rid of the typing warning that self.automl_.predict: None has no attribute predict(), as mypy can not establish that self.automl_ had been set in any methods.

This PR also does some other small typing cleanups.

Union[SUPPORTED_TARGET_TYPES, spmatrix] -> SUPPORTED_TARGET_TYPES
Move self.per_run_time_limit to be set at construction
Explicitly marks AutoSklearnEstimator as an ABC
- Was always functioning this way but was not marked as such
AutoSklearnEstimator now relies on subclasses having two @abstractmethod so that validation only has to occur once and not in each subclass.
- _supported_target_types
- _automl_cls
Removed method predict_proba from AutoSklearnEstimator, this should only be available in the subclass AutoSklearnClassifier
Removed AutoSklearnRegressor.fit so it just uses the base class fit method. It was doing nothing but forwarding arguments.
AutoSklearn2Classifer now explicitly sets attributes on the already constructed self.automl_, where previously it relied on it being constructed during fit

Main Change

# Now
class AutoSklearnEstimator(...):

    def __init__(...):
        self.automl_ = self._automl_cls(...) # Has type AutoML 
    
    def fit(...):
        ...

# Before
class AutoSklearnEstimator(...):

    def __init__(...):
         self.automl_ = None # Has type None
    
    def fit(...):
        if not self.automl_:
            self._build_automl()
        ...
        
    def _build_automl(self):
        self.automl_ = self._get_automl_cls()(...)

Extra Context

In general, as estimators are build in a two stage process, during __init__ and then remaining variables in fit, we end up with many attributes set to None during the __init__ phase. This causes problems with mypy's typing as it can not garuntee that it will not be None when the are used later.

There are two solutions to this I can think of:

Move any attributes we can create to __init__ from fit. In general this is probably better practice and safer.
Move attributes to a @property
```
@property
def attribute_wanted(self) -> T:
    if not self._attribute_wanted:
        self._attribute_wanted = ...
    return self._attribute_wanted
```
This second solution works fine when we can use the combo @property attributed_wanted with self._attributed_wanted.
It has the issue when we want @property _attribute_wanted and self.__attributed_wanted.

While we could have the reference as __attributed_wanted, with the double leading underscore, this invokes pythons name scrambling and subclass will not be able to access this attribute directly and will be require to use the @property _attributed_wanted. This may or may not be a problem.

This reverts commit e8af236, reversing changes made to 57463cf.

…elopment

* Setting `feature_preprocessors` Throws a ValueError. kwarg is `feature_preprocessor`

* Updated code samples to use ..code:: python

* Fixes failing test by setting random_state for make_x from sklearn.datsets

* Uses full path in `setup.py` instead of relative one

* Added contribution guide

* Fixes not checking for y_test being sparse

* Update example on extending data preprocessing with `no_preprocessing`

Update a link for `scenario` argument for the new SMAC documentation

* Added Citation.cff file

* Fixes old reference to HPOlibConfigSpace to ConfigSpace* Fixes

eddiebergman · 2021-11-04T17:18:22Z

Need to check if calling fit twice will work.

Edit: Done, test/test_automl/test_estimators.py::test_can_fit_twice

eddiebergman · 2021-11-04T18:35:42Z

autosklearn/experimental/askl2.py

                'mlp',
            ]
        self.include['classifier'] = include_estimators
+        self.automl_._include['classifier'] = include_estimators


This is due to not building automl_ on fit, we have to set this manually as we do for the estimator.

eddiebergman · 2021-11-04T18:35:55Z

autosklearn/experimental/askl2.py

                self.metric = log_loss

+            self.automl_._metric = self.metric
+


Similar here

eddiebergman · 2021-11-04T18:36:02Z

autosklearn/experimental/askl2.py

+        # Set the variables in the SklearnEstimator.automl
+        self.automl_._resampling_strategy = resampling_strategy
+        self.automl_._resampling_strategy_arguments = resampling_strategy_kwargs
+        self.automl_._get_smac_object_callback = smac_callback


eddiebergman added 30 commits August 23, 2021 08:58

csvs

187caaf

util file

76a77d1

Added .gitattributes:

54edbe8

Added generate-baselines

31f3424

update

cdbbeac

Fixed branch envvar

f4257ae

Removed excess path part

cd79a61

Fix branch extract

ddf118b

Typo fix

2aec0ff

Typo fixes

3e467b3

branch extract?

e188933

filename fix

96a89e6

path fix

742a48c

path fix

a34a0ee

fix tadodedoo

80e4cf6

1 step closer to going home

08b67b4

sigh...typo

16d144b

regression workflow stuff

6a1ae11

Updated to new flow

7f29d4a

switched to baseline off development

c8c7a1e

Fix yaml

bf93868

fix again ...

ba62e9f

first sigh

5007b8b

Event fixes, second sigh

a2ce558

third sigh

819c1fc

Finding issues

ce6072a

message

454bd9b

narrow down?

f09b673

maybe multiple jobs?

fe2bdd2

create-comment issue?

0aa7492

eddiebergman and others added 4 commits October 1, 2021 02:19

Merge branch 'master' into master

e8af236

Revert "Merge branch 'master' into master"

1fa4482

This reverts commit e8af236, reversing changes made to 57463cf.

fix release notes

a21b488

Merge branch 'development' of github.com:automl/auto-sklearn into dev…

502c136

…elopment

eddiebergman added the PR: Review Ready label Oct 5, 2021

mdbecker and others added 9 commits November 3, 2021 14:28

Fix typo (#1271)

654c854

* Setting `feature_preprocessors` Throws a ValueError. kwarg is `feature_preprocessor`

Update code examples in manual (#1270)

526049d

* Updated code samples to use ..code:: python

Fixes random_state for make_x from sklearn.datasets (#1263)

f899306

* Fixes failing test by setting random_state for make_x from sklearn.datsets

Updated to use full paths in setup.py (#1259)

b9c36ff

* Uses full path in `setup.py` instead of relative one

Contribution guide (#1282)

677734d

* Added contribution guide

checks y_test for sparse (#1248)

2693c4c

* Fixes not checking for y_test being sparse

Update example on extending data preprocessing (#1269)

89d6018

* Update example on extending data preprocessing with `no_preprocessing`

Update link for scenario (#1277)

36c6e0b

Update a link for `scenario` argument for the new SMAC documentation

Include a CITATION.cff file (#1261)

26aee2e

* Added Citation.cff file

mfeurer added PR: In progress and removed PR: Review Ready labels Nov 3, 2021

Fixes old reference to HPOlibConfigSpace to ConfigSpace

cebad79

* Fixes old reference to HPOlibConfigSpace to ConfigSpace* Fixes

eddiebergman added 4 commits November 4, 2021 19:06

Fixes to typing

3d4c6ec

removed rogue print, flake8'd

fecb6bd

merge

2140d6b

Added test to make sure an estimator can be fit twice

0010e54

eddiebergman commented Nov 4, 2021

View reviewed changes

autosklearn/experimental/askl2.py

self.metric = log_loss

self.automl_._metric = self.metric

Copy link

Contributor Author

eddiebergman Nov 4, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar here

eddiebergman commented Nov 4, 2021

View reviewed changes

eddiebergman added PR: Review Ready and removed PR: In progress labels Nov 23, 2021

eddiebergman added the PR: Minor label Dec 1, 2021

eddiebergman closed this Dec 18, 2021

eddiebergman deleted the typing_fixes_in_estimators branch February 9, 2022 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Typing fixes and cleanup in `estimators.py` #1249

Typing fixes and cleanup in `estimators.py` #1249

Uh oh!

eddiebergman commented Sep 15, 2021 •

edited

Loading

Uh oh!

eddiebergman commented Nov 4, 2021 •

edited

Loading

Uh oh!

eddiebergman Nov 4, 2021

Uh oh!

eddiebergman Nov 4, 2021

Uh oh!

eddiebergman Nov 4, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Typing fixes and cleanup in estimators.py #1249

Typing fixes and cleanup in estimators.py #1249

Uh oh!

Conversation

eddiebergman commented Sep 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Main Change

Extra Context

Uh oh!

eddiebergman commented Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eddiebergman Nov 4, 2021

Choose a reason for hiding this comment

Uh oh!

eddiebergman Nov 4, 2021

Choose a reason for hiding this comment

Uh oh!

eddiebergman Nov 4, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Typing fixes and cleanup in `estimators.py` #1249

Typing fixes and cleanup in `estimators.py` #1249

eddiebergman commented Sep 15, 2021 •

edited

Loading

eddiebergman commented Nov 4, 2021 •

edited

Loading